Search CORE

341 research outputs found

MetAMOS: A modular and open source metagenomic assembly and analysis pipeline

Author: Astrovskaya I
Darling AE
Koren S
Liu B
Ondov B
Phillippy AM
Pop M
Sommer DD
Treangen TJ
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

© 2013 Treangen et al. We describe MetAMOS, an open source and modular metagenomic assembly and analysis pipeline. MetAMOS represents an important step towards fully automated metagenomic analysis, starting with next-generation sequencing reads and producing genomic scaffolds, open-reading frames and taxonomic or functional annotations. MetAMOS can aid in reducing assembly errors, commonly encountered when assembling metagenomic samples, and improves taxonomic assignment accuracy while also reducing computational cost. MetAMOS can be downloaded from: https://github.com/treangen/MetAMOS

Crossref

Springer - Publisher Connector

OPUS - University of Technology Sydney

PubMed Central

eScholarship - University of California

Digital Repository at the University of Maryland

Locating a Tree in a Phylogenetic Network in Quadratic Time

Author: BME Moret
G Cardona
IA Kanj
JM Chan
K McBreen
L Iersel van
L Nakhleh
L Parida
L Wang
P Jenkins
T Dagan
T Marcussen
TJ Treangen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/02/2015
Field of study

International audienceA fundamental problem in the study of phylogenetic networks is to determine whether or not a given phylogenetic network contains a given phylogenetic tree. We develop a quadratic-time algorithm for this problem for binary nearly-stable phylogenetic networks. We also show that the number of reticulations in a reticulation visible or nearly stable phylogenetic network is bounded from above by a function linear in the number of taxa

arXiv.org e-Print Archive

Crossref

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level

Author: AC Darling
Annie Gendrault
C Medigue
Christophe Caron
D Halpern
GM Pupo
H Chiapello
Hélène Chiapello
Jérome Blum
KD Pruitt
L Florea
M Hoebeke
M Hohl
M Touchon
Marie-Agnès Petit
Meriem El Karoui
P Kersey
P Stothard
R Chenna
RR Chaudhuri
S Kurtz
TJ Treangen
W Miller
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

BACKGROUND: The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons. DESCRIPTION: Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons. CONCLUSION: The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic

Crossref

Springer

Springer - Publisher Connector

edoc

Directory of Open Access Journals

PubMed Central

HAL Descartes

Edinburgh Research Explorer

ProdInra

Hal-Diderot

Plant-RRBS, a bisulfite and next-generation sequencing-based methylome profiling method enriching for coverage of cytosine positions

Author: A Akalin
A Meissner
A Verkest
AJ Bewick
AR Elhamamsy
AR Quinlan
B Langmead
Bram Slabbinck
C Becker
CA Ibarra
Cindy Martens
D Meng
D Pignatta
EJ Finnegan
F Johannes
Frederik Coppens
G Gremme
H Parkinson
H Saze
H Schöb
H Stroud
H Zhang
H-Q Wang
HJ Xie
International Rice Genome Sequencing Project
J Du
J Yu
JI Gent
K Manning
K Okonechnikov
M Block De
M Block De
M Choi
M Hauben
M Schmidt
Magdalena Woloszynska
Marc De Block
Martin Schmidt
MD Schultz
MG Murray
Michiel Van Bel
Mieke Van Lijsebettens
P Cubas
PF Gugger
PS Schnable
R Lister
RJ Schmitz
SE Jacobsen
SJ Cokus
T-F Hsieh
The Arabidopsis Genome Initiative
TJ Treangen
TP Gurp van
W Guo
X Cao
X Chen
X Li
X Wang
X Zhang
ZD Smith
ZL Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Background: Cytosine methylation in plant genomes is important for the regulation of gene transcription and transposon activity. Genome-wide methylomes are studied upon mutation of the DNA methyltransferases, adaptation to environmental stresses or during development. However, from basic biology to breeding programs, there is a need to monitor multiple samples to determine transgenerational methylation inheritance or differential cytosine methylation. Methylome data obtained by sodium hydrogen sulfite (bisulfite)-conversion and next-generation sequencing (NGS) provide genome- wide information on cytosine methylation. However, a profiling method that detects cytosine methylation state dispersed over the genome would allow high-throughput analysis of multiple plant samples with distinct epigenetic signatures. We use specific restriction endonucleases to enrich for cytosine coverage in a bisulfite and NGS-based profiling method, which was compared to whole-genome bisulfite sequencing of the same plant material. Methods: We established an effective methylome profiling method in plants, termed plant-reduced representation bisulfite sequencing (plant-RRBS), using optimized double restriction endonuclease digestion, fragment end repair, adapter ligation, followed by bisulfite conversion, PCR amplification and NGS. We report a performant laboratory protocol and a straightforward bioinformatics data analysis pipeline for plant-RRBS, applicable for any reference-sequenced plant species. Results: As a proof of concept, methylome profiling was performed using an Oryza sativa ssp. indica pure breeding line and a derived epigenetically altered line (epiline). Plant-RRBS detects methylation levels at tens of millions of cytosine positions deduced from bisulfite conversion in multiple samples. To evaluate the method, the coverage of cytosine positions, the intra-line similarity and the differential cytosine methylation levels between the pure breeding line and the epiline were determined. Plant-RRBS reproducibly covers commonly up to one fourth of the cytosine positions in the rice genome when using MspI-DpnII within a group of five biological replicates of a line. The method predominantly detects cytosine methylation in putative promoter regions and not-annotated regions in rice. Conclusions: Plant-RRBS offers high-throughput and broad, genome- dispersed methylation detection by effective read number generation obtained from reproducibly covered genome fractions using optimized endonuclease combinations, facilitating comparative analyses of multi-sample studies for cytosine methylation and transgenerational stability in experimental material and plant breeding populations

Crossref

ZENODO

Ghent University Academic Bibliography

Directory of Open Access Journals

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

GenomeBlast: a web tool for small genome comparison

Author: AL Delcher
DD Womble
DL Swofford
Etsuko N Moriyama
Guoqing Lu
J Felsenstein
JO Korbel
KA Frazer
KP O'Brien
L Florea
Liying Jiang
Luwen Zhang
M Berriman
M Remm
MD Hendy
MG Montague
MM Alba
RD Page
Resa MK Helikar
RL Tatusov
S Kurtz
S Schwartz
S Yang
SF Altschul
T Treangen
T Xie
Thaine W Rowley
TJ Carver
Xianfeng Chen
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Comparative genomics has become an essential approach for identifying homologous gene candidates and their functions, and for studying genome evolution. There are many tools available for genome comparisons. Unfortunately, most of them are not applicable for the identification of unique genes and the inference of phylogenetic relationships in a given set of genomes. RESULTS: GenomeBlast is a Web tool developed for comparative analysis of multiple small genomes. A new parameter called "coverage" was introduced and used along with sequence identity to evaluate global similarity between genes. With GenomeBlast, the following results can be obtained: (1) unique genes in each genome; (2) homologous gene candidates among compared genomes; (3) 2D plots of homologous gene candidates along the all pairwise genome comparisons; and (4) a table of gene presence/absence information and a genome phylogeny. We demonstrated the functions in GenomeBlast with an example of multiple herpesviral genome analysis and illustrated how GenomeBlast is useful for small genome comparison. CONCLUSION: We developed a Web tool for comparative analysis of small genomes, which allows the user not only to identify unique genes and homologous gene candidates among multiple genomes, but also to view their graphical distributions on genomes, and to reconstruct genome phylogeny. GenomeBlast runs on a Linux server with 4 CPUs and 4 GB memory. The online version of GenomeBlast is available to public by using a Web browser with the URL

Crossref

DigitalCommons@University of Nebraska

Springer - Publisher Connector

PubMed Central

The University of Nebraska, Omaha

Rapid in situ imaging and whole genome sequencing of biofilm in neonatal feeding tubes: a clinical proof of concept

Author: A Alkeskas
A Bankevich
A Greenough
A Karaaslan
AI Hidron
B Brooks
BG Mitchell
C Dreszera
C Haisch
C Iversen
CS Cheung
D Huang
D Lebeaux
E Hurrell
E Jimenez
EE Jackson
JN Wilking
JR Mehall
JR Mehall
JW Costerton
KA Kline
KE Holt
L Tóth
LE Hancock
M Gómez
O Holy
RM Donlan
S Bialek-Davenet
SJ Forsythe
SM Petersen
SM Townsend
TJ Treangen
V Martin
W Drexler
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2017
Field of study

The bacterial flora of nasogastric feeding tubes and faecal samples were analysed for a low-birth weight (725g) neonate EGA 25 weeks in intensive care. Samples were collected at age 6 and 8 weeks of life. Optical coherence tomography (OCT) was used to visualise bacterial biofilms inside the nasogastric feeding tubes. The biofilm was heterogeneously distributed along the tube lumen wall, and had a depth of up to 500µm. The bacterial biofilm and faecal samples included Enterococcus faecalis and Enterobacter hormaechei. Representative strains, recovered from both feeding tubes and faecal samples, were whole genome sequenced using Illumina, Mi-Seq, which revealed indistinguishable strains, each with less than 28 SNP differences, of E. faecalis and E. hormaechei. The E. faecalis strains were from two sequence types (ST191 and ST211) and encoded for a number of traits related to biofilm formation (BopD), adherence (Epb pili), virulence (cps loci, gelatinase, SprE) and antibiotic resistances (IsaA, tetM). The E. hormaechei were all ST106, and encoded for blaACT-15 β–lactamase and fosfomycin resistance (fosA). This proof of concept study demonstrates that bacterial flora within the neonatal feeding tubes may influence the bacterial colonisation of the intestinal tract and can be visualised nondestructively using OCT

Crossref

Nottingham Trent Institutional Repository (IRep)

Directory of Open Access Journals

The influence of the accessory genome on bacterial pathogen evolution

Author: Abu-Ali GS
Adiba S
Alfano JR
Arnold DL
Arnold DL
Arnold DL
Asadulghani M
Baharoglu Z
Baquero F
Barash I
Blondel CJ
Blondel CJ
Brüssow H
Cambray G
Chen J
Chen Y
Choi J
Colinon C
Croucher NJ
Dawes FE
De Gelder L
Diard M
Dillon SC
Douard G
Doyle M
Elsaied H
Fondi M
Freeman VJ
Gartemann KH
Gillings MR
Godfrey SAC
Govind R
Greenberg JT
Grillot-Courvalin C
Groisman EA
Guerin E
Hacker J
Hacker J
Halary S
Hassan F
Hazen TH
Hegstad K
Heinemann JA
Heringa S
Hochhut B
Holden MT
Huang L
Imamovic L
Jackson RW
Jackson RW
Jenner C
Joss M
Jové T
Kearney B
Kers JA
Kiiru JN
Koenig JE
Koenig JE
Krauland MG
Landgraf A
Larsson P
Leplae Rl
León G
Lipps HJ
Lloyd AL
Loftie-Eaton W
Lovell HC
Lovell HC
Manning SD
Marchetti M
Matz C
Maurelli AT
Mazel D
Michael CA
Michael CA
Morris CE
Morris CE
Moura A
Naas T
Nadarasah G
Naka H
Nawaz M
Ogura Y
Paauw A
Partridge SR
Pitman A
Poirel L
Poirel L
Ramirez MS
Rankin DJ
Rezzonico F
Rivas LA
Rodriguez-Martinez JM
Rohmer L
Rosewarne CP
Sajjad A
Salanoubat M
Salzberg SL
Seth-Smith H
Shaheen BW
Siguier P
Siguier P
Smillie C
Smith AB
Song H
Sota M
Steinberg KM
Sundin GW
Sundin GW
Tao L
Treangen TJ
van der Meer JR
van der Veen EL
van Essen-Zandbergen A
Wagner A
Waldor MK
Wiesner M
Woodford N
Yang H
Zhou Z
Zupan J
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2011
Field of study

Bacterial pathogens exhibit significant variation in their genomic content of virulence factors. This reflects the abundance of strategies pathogens evolved to infect host organisms by suppressing host immunity. Molecular arms-races have been a strong driving force for the evolution of pathogenicity, with pathogens often encoding overlapping or redundant functions, such as type III protein secretion effectors and hosts encoding ever more sophisticated immune systems. The pathogens’ frequent exposure to other microbes, either in their host or in the environment, provides opportunities for the acquisition or interchange of mobile genetic elements. These DNA elements accessorise the core genome and can play major roles in shaping genome structure and altering the complement of virulence factors. Here, we review the different mobile genetic elements focusing on the more recent discoveries and highlighting their role in shaping bacterial pathogen evolution

Central Archive at the University of Reading

Crossref

PubMed Central

UWE Bristol Research Repository

Academica-e

Read Length and Repeat Resolution: Exploring Prokaryote Genomes Using Next-Generation Sequencing Technologies

Author: AL Delcher
B Haubold
C Fraser
C Kingsford
Claudio U. Köser
D Hernandez
D MacLean
DR Zerbino
DW Bryant
E Mardis
E Mardis
E Stackebrandt
ES Lander
G Achaz
I Maccallum
J Eid
J Shendure
JC Dohm
John A. C. Archer
M Chaisson
M Margulies
M Pop
Matt J. Cahill
MC Wendl
N Hall
N Whiteford
Nicholas E. Ross
O Morozova
RA Farrer
S Kurtz
SF Altschul
SL Salzberg
TJ Treangen
Wenjun Li
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50 % of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism unde

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement

Multiple genome alignment remains a challenging problem. Effects of recombination including rearrangement, segmental duplication, gain, and loss can create a mosaic pattern of homology even among closely related organisms.We describe a new method to align two or more genomes that have undergone rearrangements due to recombination and substantial amounts of segmental gain and loss (flux). We demonstrate that the new method can accurately align regions conserved in some, but not all, of the genomes, an important case not handled by our previous work. The method uses a novel alignment objective score called a sum-of-pairs breakpoint score, which facilitates accurate detection of rearrangement breakpoints when genomes have unequal gene content. We also apply a probabilistic alignment filtering method to remove erroneous alignments of unrelated sequences, which are commonly observed in other genome alignment methods. We describe new metrics for quantifying genome alignment accuracy which measure the quality of rearrangement breakpoint predictions and indel predictions. The new genome alignment algorithm demonstrates high accuracy in situations where genomes have undergone biologically feasible amounts of genome rearrangement, segmental gain and loss. We apply the new algorithm to a set of 23 genomes from the genera Escherichia, Shigella, and Salmonella. Analysis of whole-genome multiple alignments allows us to extend the previously defined concepts of core- and pan-genomes to include not only annotated genes, but also non-coding regions with potential regulatory roles. The 23 enterobacteria have an estimated core-genome of 2.46Mbp conserved among all taxa and a pan-genome of 15.2Mbp. We document substantial population-level variability among these organisms driven by segmental gain and loss. Interestingly, much variability lies in intergenic regions, suggesting that the Enterobacteriacae may exhibit regulatory divergence.The multiple genome alignments generated by our software provide a platform for comparative genomic and population genomic studies. Free, open-source software implementing the described genome alignment approach is available from http://gel.ahabs.wisc.edu/mauve

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

OPUS - University of Technology Sydney

PubMed Central

Insertion Sequence Inversions Mediated by Ectopic Recombination between Terminal Inverted Repeats

Author: A Barzel
Alison Ling
C Feschotte
C Vitte
DJ Hedges
DW Martin
ES Lander
F Yang
G Marais
G Santoyo
HM Arends
J Filee
J Foster
J Parkhill
L Klasson
L Klasson
M Chandler
M Rosenberg
M Wu
Mark A. Batzer
P Siguier
P Siguier
PC Weber
PC Weber
R Belshaw
R Cordaux
R Cordaux
R Cordaux
R Cordaux
Richard Cordaux
S Leclercq
S Pichon
SG Andersson
SL Salzberg
T Wicker
TA Hall
TJ Treangen
WS Reznikoff
Z Nagy
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Transposable elements are widely distributed and diverse in both eukaryotes and prokaryotes, as exemplified by DNA transposons. As a result, they represent a considerable source of genomic variation, for example through ectopic (i.e. non-allelic homologous) recombination events between transposable element copies, resulting in genomic rearrangements. Ectopic recombination may also take place between homologous sequences located within transposable element sequences. DNA transposons are typically bounded by terminal inverted repeats (TIRs). Ectopic recombination between TIRs is expected to result in DNA transposon inversions. However, such inversions have barely been documented. In this study, we report natural inversions of the most common prokaryotic DNA transposons: insertion sequences (IS). We identified natural TIR-TIR recombination-mediated inversions in 9% of IS insertion loci investigated in Wolbachia bacteria, which suggests that recombination between IS TIRs may be a quite common, albeit largely overlooked, source of genomic diversity in bacteria. We suggest that inversions may impede IS survival and proliferation in the host genome by altering transpositional activity. They may also alter genomic instability by modulating the outcome of ectopic recombination events between IS copies in various orientations. This study represents the first report of TIR-TIR recombination within bacterial IS elements and it thereby uncovers a novel mechanism of structural variation for this class of prokaryotic transposable elements

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central